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1 Introduction 

Rational approximation to a given function on a curve in the complex plane is a classical topic from analysis, 
and a cornerstone of modeling and design in several areas of applied sciences and engineering. Special interest 
attaches to the case where the approximated function extends holomorphically on one side of the curve. In 
connection with system identification and control, such issues typically arise on the line or the circle where 
they make contact with extremal problems in Hardy spaces [4, 15, 33, 29, 30, 32, 43]. Our model curve in 
this paper will be the circle, though everything translates easily to the line. The criterion under examination 
will be the L 2 -norm. 

From the approximation-theoretic viewpoint, much attention has been directed towards error rates, in 
connection with smoothness of the approximated function. Let us mention Peller’s converse theorems on 
the speed of rational approximation [33], Glover’s construction of near-best uniform rational approximants 
[15], Parfenov’s solution of a conjecture by Gonchar on the degree of rational approximation to holomorphic 
functions on compact subsets of the domain of analyticity [31], the Gonchar-Rakhmanov estimates in uniform 
rational approximation to sectionally holomorphic functions off an S'-contour, and its generalization to best 
L 2 and L p approximants in [8, 42]. 

The present paper is, in part, a sequel to [8]. In the latter reference best L 2 and L°° rational approximants 
are compared in the n-th root sense, whereas here we compare them in norm. We emphasize that the L 2 
norm and weighted variants thereof are of great importance in applications, due to their interpretation as a 
variance in a stochastic context. Moreover, best rational H 2 approximants have the interesting property of 
being attained through interpolation [26]. Note also that certain functions, like Blaschke products, can be 
approximated in U 2 -norm but not in the uniform norm by rational functions. 

A key to the above-mentioned comparison is the derivation of lower bounds on the L 2 approximation 
error. Lower bounds in approximation are usually difficult to obtain; we dwell here on a topological machinery 
developed in [6] which expresses the approximation error as the solution to a max — min problem, and we 
rely as well on the Adamjan-Arov-Krein theory of best uniform meromorphic approximation. We prove a 
somewhat general result (Theorem 4) which gives a lower bound on the L 2 -best rational approximation error 
of given degree, in terms of the ratios of L 2 and L°° norms of the singular vectors of the Hankel operator 
with symbol the approximated function. We then apply it to three cases where these ratios can be estimated: 
rational functions, Blaschke products, and Cauchy integrals on geodesic arcs. We use also the max —min 
principle to study linearized errors as a means to compute further lower bounds. We also include numerical 
experiments, some of which give excellent accuracy to estimate the H 2 error in rational approximation (see 
Table 1 in Section 7). To the author’s knowledge, such results are first of their kind. 

The paper is organized as follows. After some preliminaries on Hardy spaces in Section 2, we present in 
Section 3 the approximation problems that we consider. Section 4 is an introduction to the results of [6] and 
it contains a basic account of the Adamjan-Arov-Krein theory. We derive in Section 5 our main theorem 
giving lower bounds in L 2 rational approximation, and we apply it to cases mentioned above. Finally, in 
section 6, we discuss linearized errors. 

2 Notations and Preliminaries 

Let B be the unit disk in the complex plane C, and T the unit circle. We denote by C(T) the space of 
continuous, complex-valued functions on T. For 1 < p < oo, we put L p = L P (T) for the familiar Lebesgue 
space of complex measurable functions on T such that 



Hereafter, we let H 2 = H 2 { B) be the Hardy space of holomorphic functions in B whose Taylor coefficients 
at 0 are square summable: 


H 2 = {f(z) = Z%L 0 a k z k : \\f\\ H i := S^oKI 2 < +oo}. 
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We refer the reader to [14] for standard facts on Hardy spaces. By Parseval’s relation 

1 r 27T 

||/||^ 2 = sup — / \f(re l9 )\ 2 dd, (1) 

0<r<l ^ JO 

and the map 

(/(*) = X%L 0 a k z k ) —► (/*(e lS ) := ^ = 0 a k e ike ) 

is an isometry from H 2 onto the closed subspace of L 2 comprised of functions whose Fourier coefficients 
of strictly negative index do vanish. As is customary, we shall identify H 2 with this subspace so that the 
distinction between / and f* as well as ||/||_f /2 and ||/*||2 will disappear. This conveniently allows one to 
regard members of the Hardy class both as functions on D and on T. From the function-theoretic viewpoint, 
the correspondance f f* is that /*(e* e ) is almost everywhere the limit of f(z) as z tends non-tangentially 
to e l8 within D. 

We put H 2,0 = H 2,0 ( C \ D) for the companion Hardy space of holomorphic functions in C \ B, vanishing 
at infinity, whose Taylor coefficients there are square summable: 

H 2 ’° = {/(*) = EgLi a k z~ k : \\f\\ m , 0 := E£° =1 |a fc | 2 < +oo}. 

The map 

(m = sr =1 «^~- fc ) —► (/v 9 ) = ^ iake - M ) 

is an isometry from H 2,0 onto the closed subspace of L 2 comprised of functions whose Fourier coefficients of 
non-negative index do vanish, and as before we identify H 2,0 with the latter. Clearly we have an orthogonal 
sum: 

L 2 = H 2 ® H 2 ’ 0 . (2) 

In fact, it holds that / £ H 2,0 if and only if the function / given by 

/0) := z~ l f(l/z) (3) 

lies in H 2 , and the map f >-> f is an involutive isometry of L 2 sending H 2 onto H 2 ’ 0 . Actually, / has same 
modulus as / pointwise on T since f{\/z) = f(z) when | 2 | =1. If / is holomorphic on H, then P(z) = 
is holomorphic on the reflection of H across T, and if / is rational /** is likewise rational. Of course, a relation 
like P = f must be understood to hold on T only. 

We let 

P+ (E feez a k e ike ) = E fe > 0 a k e lke and P_ (E keZ a k e lke ) = E k<0 a k e ike 

indicate the so-called Riesz projections that discard the Fourier coefficients of strictly negative and non¬ 
negative index respectively. Clearly P + (resp. P_) contractively maps L 2 onto H 2 (resp. H 2,0 ) and 
P+ + P_ = I. We call P_|_ the analytic projection and P_ the anti-analytic projection. Note that, by 
Cauchy’s formula, P±(/) can be expressed as Cauchy integrals: 

w<1 ' p - mz) =P |2|>l < 4 > 

The Hardy space H°° = H°° (D) consists of bounded holomorphic functions on B, endowed with the sup 
norm. From (1) we see that H°° embeds contractively in H 2 , in particular each / £ H°° has a non-tangential 
limit /* on T. It can be shown that ||/*||oo = ll/lli? 00 , an d that the map />—>■/* is an isometry from H°° 
onto the closed subspace of L°° comprised of functions whose Fourier coefficients of strictly negative index 
do vanish. Again we identify H°° with this subspace. Likewise, the space H°°’ 0 of bounded holomorphic 
functions vanishing at infinity in C \ B identifies via non-tangential limits with the closed subspace of L°° 
consisting of functions whose Fourier coefficients of non-negative index do vanish. However, in contrast with 
the situation for L 2 , the operators P± are unbounded on L°°. Besides the norm topology, H°° inherits 
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the weak-* topology from L°°(T). It is characterized by the fact that f n tends weak-* to / if and only if 
fj fnP — > f T ftp for every p> £ L 1 . It is equivalent to require that (||/ n ||oo)n is a bounded sequence and that, 
for each k , the fc-th Fourier coefficient of f n converges to the fc-th Fourier coefficient of /. 

As is well-known [14, ch. II, cor. 5.7], a nonzero / £ H 2 factors uniquely as / = jw where 

«,W = exp{i-^ 2 'J±ilog|/ ( e -«)|rfe} (5) 


belongs to H 2 and is called the outer factor of /, normalized so as to be positive at zero, while j £ H°° has 
modulus 1 a.e. on T and is called the inner factor of /. The latter may be further decomposed as j = bS, 
where 


b(z) 


=n 

0#o 


~C i 

101 


z-Cl 
1 - ClZ 


( 6 ) 


is the normalized Blaschke product , with multiplicity k > 0 at the origin, associated to a sequence of points 
Ci £ B \ {0} and to a constant c £ T, while 


si ‘ )=e * p {-hl 

is the singular inner factor associated with a positive singular measure p on T. The Cl are of course the zeros 
of / in D, counting multiplicities by repetition. The number of zeros, finite or infinite, is called the degree 
of the Blaschke product. Throughout, we let B n denote the set of Blaschke products of degree at most n. If 
the degree is infinite, the convergence of the product in (6) is equivalent to the condition 

^(i - 101) c OO (7) 

i 

which holds automatically when / £ H 2 . That w(z) is well-defined rests on the fact that log |/| £ L 1 
if / £ H 2 \ {0}D. A function / £ H 2 with inner-outer factorization / = jw lies in H°° if, and only if 
w £ L°°(T). For simplicity, we often say that a function is outer (resp. inner) if it is equal to its outer (resp. 
inner) factor. 

We put T n [z\ for the space of complex algebraic polynomials of degree at most n in the variable z, or 
simply V n if the variable is understood. Below we let Z(q) indicate the set of zeros of a polynomial q. For 
q n £ V n [z] , we define its reciprocal polynomial to be 

q n (z) := z n q n (l/z). 


We warn the reader that this definition depends on n: if we consider q n -i £ V n -i as an element of V n with 
zero leading coefficient, the definitions of q n -i{z) in V n -\ and in V n may be inconsistent. Therefore we 
always specify, e.g. via a subscript “n” as in “q n ”, which definition is used. Clearly the “tilde” operation is 
an involution of V n preserving modulus pointwise on T. 

We designate by 7 Z m , n = 7Z-m. n (z) the set of complex rational functions of type (m,n) in L 2 , namely 
those that can be written as p m /<ln where p m belongs to V m and q n £ V n has no root on T. When r = p m /gn 
is in irreducible form, the integer max{m, n} is the (exact) degree of r. Note that B m C V m ,m is comprised 
of rational functions of degree at most m which are analytic in ID and have unit modulus everywhere on T. 
Alternatively, B m consists of functions q m /qm where q m £ V m has all its roots in D. Clearly, B m is included 
in the unit sphere of both H 2 and H°°. 

We further set 

Hm '■= {— : g £ H 2 , q m £ V m }- 
Qm 

Members of H 2 n identify in L 2 with non-tangential limits of meromorphic functions with at most to poles in 
D (counting multiplicities) whose L 2 -means over {|z| = r} remain eventually bounded as r W. Functions 
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in U m H^ are called meromorphic in L 2 . Two equivalent descriptions of H ^ are useful: on the one hand 
we get by pole-residue decomposition that H ^ = H 2 + ( lZm-i,m H H 2, °), on the other hand we have that 
H ^ = I?" 1 # 2 , the set of quotients of H 2 - functions by Blaschke products of degree at most m. Likewise we 
put 

n L°° = Bfi^H 00 = {— : g £ H°°, q m £ Vm} 

Qm 

for the set of meromorphic functions with at most m poles in L°°. 


3 Best rational and meromorphic approximation in L 1 

For n > 1 an integer, the best rational approximation problem of degree n in L 2 is: 
Problem R(n): Given h £ L 2 , to find r* £ TZ n ,n such that 

\\h-r*\\ 2 = min \\h - r\\ 2 . 

r£/Zn,n 


Write h = hi + h 2 with hi £ H 2 , h 2 £ H 2,0 . By partial fraction expansion, each r £ lZ n , n can be 
decomposed as n + r 2 where rq £ H 2 , r 2 £ H 2 ’ 0 , and deg ri + deg r 2 < n. Then, by (2), 

\\h~r\\l = \\hi-n\\l + \\h 2 -r 2 \\l 

so that problem R(n) reduces, modulo optimal allocation of the degrees of ri and r 2 (n + 1 choices), to a 
pair of problems of the following types: 

Problem RA(n): Given f £ H 2 , to find r* £ TZ n , n H H 2 such that 

||/-r*[| 2 = min II/ —r|| 2 . 

r£lZ n , n nH 2 

Problem RAB(n): Given f £ R 2,0 , to find r* £ lZ n - i >ra fl H 2,0 such that 

\\f~r*\\ 2 = min ||/-r|| 2 . 
re7?.„_i, n ni/ 2 .o 

In “RA(n)“ and “RAB(n)”, the letter ”A” is mnemonic for “analytic” and ”B” stands for “bar”. 
Problem RA(n) is in fact equivalent to RAB(n). For we can parametrize r £ TZ n , n f~l H 2 as r(0) + zr% 
where r(0) £ R and r% £ lZ n -i^ n fl H 2 vary independently, and by Parseval’s theorem 

\\f-r\\ 2 2 = \f(0)-rm 2 + m~fm^zr 3 \\l 

hence r(0) = /(0) is the optimal choice. Thus, since multiplication by 1/z is an isometry, we find upon 
replacing / by (/ — f(Q))/z that Problem RA(n) is equivalent to the normalized version: 

Problem RAN(n): Given f £ H 2 , to find r* £ TZ n -i^ n fl H 2 such that 

11/ — r*|| 2 = min ||/-r|| 2 . 

r£U n -i,„r\H 2 

Now, applying the check operation defined in (3), which preserves TZ n -i^ n and the degree, this last 
problem is seen to be equivalent to RAB(n), as announced. Note that when passing from RA(n) to RAB(n), 
the initial / £ H 2 to be approximated from 7 Z n ^ n nH 2 gets transformed into the function f(l/z) — f(0) £ H 2,0 
to be approximated from lZ n -i,n H H 2,0 . Finally, we state the best meromorphic approximation problem 
with at most n poles in L 2 : 

Problem MA(n): Given / £ L 2 , to find g* £ H 2 such that 

\\f -9*h = min ||/-g|| 2 . 
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Problem MA(n) is also equivalent to RAB(n). Indeed, H 2 = H 2 + (lZ n -i tn nH 2 ’ 0 ) so that, by orthogonal¬ 
ity of H 2 and H 2, °, the 7L 2 -component of a minimizer in MA(n) must be P +(/) while the // 2,0 -component 
of this minimizer is a solution to RAB(n) with / replaced by P —(f). 

Let us mention that best meromorphic approximation, unlike best rational approximation, is conformally 
invariant. This makes it of independent interest in a broader context, see [5, prop. 5.4] for further details. . 

Having reduced all previous approximation problems to RAB(?r), hereafter we discuss the latter. It is 
known that RAB(n) has a solution which needs not be unique, and every solution has exact degree n unless 
/ is rational of degree at most n — 1 [12, 26, 2]. 

We shall write d, 2 (f,Tln-i,n) (resp. d, 2 (f,'R- n ,n)) for the distance from / to TZn-i^ (resp. lZ n ,n) in L 2 . For 
instance if f £ H 2, °, then d, 2 (f,lZ n -i,n) is both the value of Problem RAB(n) and of Problem MA(n); and 
if f £ H 2 , then d 2 (f,'R. n - i, n ) (resp. (h{f, TZ n , rl )) is the value of problem RAN(n) (resp. RA(n)). Besides, 
the value of MA(n) is denoted by d 2 (f,H 2 ). 

When / £ L °°, we let d 00 (f,H^ ) ) indicate the distance from / to Hff. This is the value of the best 
meromorphic approximation problem with at most n poles in L°°, that we did not formally introduce but 
which stands analog to MA(n) with L 2 replaced by L°° and H 2 by Hff. We put also g?oo(/, B-n-i,n) (resp. 
doo(f,Tln,n)) for the distance from / to TZ n -i^ n (resp. lZ n .n) in L°°. 


4 Duality in meromorphic approximation 

Pick / £ H 2,0 and let us parametrize r £ lZ n -i, n fl H 2,0 as r = p n -i/qn where p n -i ranges over P n -\ and 
q n ranges over those polynomials in V n whose roots lie in ID. Then q n /q n £ B n and since p n -i/q n £ H 2 we 
have by orthogonality of H 2 and H 2,0 that 

11/ - ^111 = II ft - = l|P-(/^)lli + I|p + (/^) - ^lli- (8) 

Qn Qn Qn Qn Qn Qn 


Clearly the product of a W 2,0 -function by a polynomial in V n yields a member of z n H 2,0 . Therefore 

q n P+(f^) = fq n - qnP-(f^) £ z n H 2 ^ 0 n H 2 = P„_r, (9) 

Qn Qn 

entailing that p n -i = q„P+(/ q n /qn) is the minimizing choice in (8) for fixed q n . Consequently 


min || / — 'T’ll 2 — min 

r£lZ n -i, n nH 2 .° q n €Vr,,Z(q n ) ci 


-(/f^lh = min ||P_(/6„)|| 2 . 
q n b n £B n 


( 10 ) 


That the infinrum is indeed attained in the right hand side of (10) follows from (8) and the fact that RAB(n) 
has a solution. Define Af , the Hankel operator with symbol /, by 


Af : H°° —> H 2 ’ 0 

v i-)- P ~(fv). 


( 11 ) 


It is evident that Ay is continuous and that |||Ay||| = ||/|| 2 , a unit maximizing vector being v = 1. Here 
and below, we let |||.||| stand for the operator norm, and a maximizing vector of an operator A is a nonzero 
vector v such that ||i?u||/||w|| = |||-E’|||. 

The content of the discussion leading from (8) to (10) may now be restated as follows. 

Proposition 1 For f £ H 2 ’ 0 , it holds that 


d 2 (f 1 Kn-i,n) = min ||A / (5„)|| 2 . (12) 

b n £ B n 

A rational function p n -\/q n £ lZ n -i tn is a solution to RAB(n) if, and only if b n = q n /qn is a minimizing 
Blaschke product in (12) and p n -1 = q n P+(fb n ). 
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Put Ck for the space of linear operators from H°° into H 2,0 which are weak-* continuous and have rank 
not exceeding k. For k = 0,1, 2,we denote by a k (Af) the fc-th approximation number of Af defined by 

a k (A f )=mi{\\\A f -T\\\, TeC k .}. (13) 

Note that a k (Af) > a k +i(Af) and that <j 0 (Af) = |||A/|||. 

We need also introduce the genus of a closed symmetric subset K in a topological vector space; here, 
symmetric means that if v £ K then also — v £ K. By definition the genus of K , denoted by gen(K), is the 
smallest positive integer m for which there exists an odd continuous mapping 

G : K —> R m \ {0}, (14) 

or else +00 if no finite m meets the above requirement. By convention the genus is zero if K = 0. When 
K is compact and does not contain 0, then gen(i’T) is always finite, see [44]. For instance, if m > 1, the 
classical Borsuk-Ulam theorem from topology [20, ch. 2, sec. 6] implies that any symmetric set in which 
is homeomorphic to the (real) (to — l)-dimensional Euclidean sphere through an odd map has genus 

TO. 

Below, we shall be concerned with weak-* compact subsets of S°°, the unit sphere of H°°. In this 
connection, we let 

K.™ = {K C S°° : K is a weak-* compact symmetric subset of S°° with gen(iF) > to}. 
Subsequently, we define the (generalized) singular numbers of Af by 

A m(Af) = max min ||A/(u)|| 2 , to = 0,1,2,... (15) 

The following theorem, which was established in [6] , connects approximation numbers and singular numbers 
of Af with the value of Problem RAB(n): 

Theorem 1 [6, thm. 8.1] Let f £ H 2,0 and Af : H°° — > H 2 ’ 0 the Hankel operator with symbol f. For each 
integer n > 0, the following equalities hold: 


^2 (/*) l,n) — ^n(Af') — A2n+1 (Af') \ 2 n+ 2 (Aff 


(16) 


Theorem 1 is reminiscent of a famous theorem by Adamjan-Arov-Krein (in short: the AAK theorem) 
characterizing doo(/, rather than d 2 (/, i,n)- To state the result, let us define for / £ L°° the Hankel 

operator Tf by 


r/:iJ 2 —> H 2 ’° 
v 1 ^ P-(fv). 


(17) 


Although the definitions of Af and T/ are formally the same, observe that the domains in (11) and (17) are 
different. The definition of s/c(r/) is still given by (13) except that Af is replaced by T/ and T now ranges 
over linear operators from H 2 into H 2,Q having rank at most k. If in addition / is continuous on T, then T/ is 
compact [33, ch. 1, thm. 5.5]. Then, if we let T} denote the adjoint, f is a compact selfadjoint operator 
from the Hilbert space H 2 into itself and as such it has a complete orthonormal family of eigenvectors called 
the singular vectors of Tf, the associated eigenvalues are none but the squared approximation numbers of 
Tf [16, ch. II, thm. 2.1], and there holds the Courant maxmin principle [45, sec. 22.11a]: 


Sn(T f ) = ^max min ||r / (u)|| 2 , 
" +1 IMh=i 


(18) 


where V ra _|_i is the collection of linear subspaces of H 2 of complex dimension at least n- 1-1. In this Hilbertian 
context, the approximation number s„(F/) is also called the n-th singular value of Tf. We say that a function 
v is associated with a singular value s when v is an eigenvector of F}r f associated with the eigenvalue s 2 : 
v = s 2 r}r f{y). As a particular case of Equation (18) a maximizing vector is just a singular vector associated 
with s 0 (r/)• 
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Theorem 2 (The AAK theorem) [1, thms. 0.1 & 0.2] [33, ch. 4, thm. 1.2] Let f £ L°° and Tf : H 2 —> 
H 2,0 be the Hankel operator with symbol f. For each integer n > 0, it holds that 

doo(f,H™) = s n (T f ). (19) 

If in addition / £ C(T), then Tf is compact and the quantity (19) is also equal to (18). 

The case n = 0 of Theorem 2, i.e. that |||r/||| = doo(f, H°°) was known earlier as Nehari’s theorem. 

If we compare (15) and (16) with (18) and (19) for / £ H 2,0 fl L°° , we see that the main difference 
between best meromorphic approximation with at most n poles in L 2 and in L°° lies with the maximization 
step in (15), which in the L 2 - case must be taken over all compact sets of genus at least 1 2n + 2 and not 
just Euclidean spheres of real dimension 2 n + 1. It follows from [3, thm 1] or [23, thm. 5.3] that B n is 
homeomorphic to § 2 ™ +1 and inspection of the proof reveals that the homeomorphism is odd. Moreover B n 
is weak-* compact in S°° [6, lem. 7.3], therefore B n £ /C^ +2 and from Proposition 1 we see that it is a 
supremizer in (15). 

We mention for completeness a companion to Theorem 1 dealing with min max (not max min): 

Theorem 3 [34] [6, eqn. (78)] Let f £ H 2 ' 0 and Af : H°° —> H 2,0 be the Hankel operator with symbol f. 
For each integer n > 0, the following equality hold: 


d2(f,Ttn-i,n)= min max ||A/(u)|| 2 , 

VFgWti wew 

IMU=i 


( 20 ) 


where W n is the collection of linear subspaces in H°° of (complex) codimension at most n. 

Note that (20) is the exact counterpart for Af of the standard Courant min max principle for T f. 

doo(f,H™) = min max ||r/(u)|| 2 , 

IMh=i 

where X n is the collection of linear subspaces in H 2 of (complex) codimension at most n. 

Using Proposition 1 it is easy to see that if p n -i/q n is a solution to RAB(n), then the subspace (q n /q n )H°°, 
comprised of multiples of q n /q n in H°°, is a minimizing W in (20). In the rest of the paper, we use the 
maximizing step in (15) together with Theorem 1 to derive lower bounds for Problems RAB(n). 


5 Lower bounds 

5.1 Comparing L 2 and L°° meromorphic approximation 

Consider / £ H 2,0 fl L°° and r, r* £ lZ n -i,n with r* a solution to RAB(n), i.e. a best approximant to 
/ in L 2 from 1Z n -i yn . Then ||/ — r *||2 < \\f — r '||2 ■ Now, for any h. £ H°°, Parseval’s theorem gives 
11/ — r ||2 < ||/ — r — h H 2 - Finally, since the L°°-norm dominates the L 2 -norm ||/ — r* — h || 2 < ||/ — r* — h\\oo 
and so we have 

11/ — r*|| 2 < ||/ — (r + /i)||(x)- 

Thus, minimizing over r,h, we find that d 2 (f,TZ n -i, n ) < d oc (/, 77“). However, it is a priori unclear how 
large the gap between the two errors can be. Below, dwelling on Theorems 1 and 2, we derive when / is 
continuous a lower bound in terms of the ratio between L 2 and L°° norms of the singular vectors of the 
Hankel operator T/. 

1 That \ 2 n+i(Af) = A 271 + 2 (A/) ' n (16) i s inessential and due the fact that Af is complex linear whereas the genus is a real 

notion. 



Theorem 4 Let f £ H 2,0 D C(T) and n > 0 an integer. Consider an orthonormal family Vq , • ■ ■ ,v n of 
singular vectors of the Hankel operator Tf (cf. {17)), where Vk is associated to the singular value Sk{Tf). 
Define M n {f) := mm{d 00 {f 1 Hj°)/\\vj\\ 00 , 0 < j < n} if Vj £ H°° for 0 < j < n, and M n (f) := 0 otherwise. 
Then 

^dD=<d 2 {f,n n . hn ). ( 21 ) 

Vn+1 

Proof: if M n {f) = 0, then (21) is trivial. Otherwise, the linear span of {i>o, • • • , v n } over C is a real 2 n + 2- 
dimensional vector space in L 2 C\L °°, and we may endow it either with the L 2 -norm or else with the L°°-norm. 
Let S 2 and Soo indicate the corresponding unit spheres. Identifying a vector with its coordinates, we see that 
S 2 is just S 2rl+1 , and clearly v >->• w/||u||oo is an odd homeomorphism from S 2 onto Soo. Therefore, by the 
Borsuk-Ulam theorem, Soo is a compact set of genus 2n + 2. Now, if we let v £ Soo and write v = Y^j =o \ v i 
while abbreviating Sj (L/) as Sj, we get using , )” to mean Hermitian scalar product on T that 


2 2 


WMvm = (A f (v),Af(v)) = (r f (v),T f (v)) = {T* f T f v,v) = £? =0 |A/| 2 Sj 


> 


1 


n + 1 




u'=o 


>®IEWN. 

o'=o 


n + 1 


> 


M 2 (/) 

n + 1 ’ 


( 22 ) 


where the second line in (22) uses the Schwarz inequality, the definition of M n (f) together with the equality 
Sj(Tf) = doo{f,Hj °) from Theorem 2, the triangle inequality and the fact that ||u||oo = 1- Inequality (21) 
now follows from (22) and Theorem 1. 


The kernels KerA/ and KeiT/ are closed subsets of H°° and H 2 respectively, and clearly KerA/ = 
Kerr f (~l H°°. {cf. definitions (11) and (17)). By a theorem of Beurling [14, ch. II, thm. 7.1], being closed 
and shift-invariant {i.e. invariant under multiplication by the variable z), Kerr j is either trivial ({0} or 
H 2 ) or else consists of all multiples of some inner function j, that is, Kerr/ = )H 2 . In the latter case 
Ker Af = j H°°, in particular Kerr/ and KerA/ are simultaneously nontrivial. In this situation the proof of 
Theorem 4 quickly leads to an improvement of itself as follows. Notations and assumptions being as in the 
theorem, set \\vj\\H°°/KerA f to be +oo if Vj $. H°° and to be the distance from Vj to Ker A/ in H°° otherwise. 
Observe that if ||t>/ 0 \\H°°/KerA f = 0 for some jo £ {0,- • • ,n}, then Vj 0 £ KerF/ which entails that Tf has 
rank at most jo by definition of singular values. It is a theorem of Kronecker [33, ch. 1, cor. 3.2] that this 
happens if and only if / £ Hff, and since / £ H 2,0 D C(T) by assumption we get that / £ 1Z ]0 _i iJO . In 
particular it holds in this case that doo{f,H°°) = d 2 {f. D-j-i,j) = \\vj\\h°°/K eiA f = 0 for all j > jo- Keeping 
this observation in mind, let us define 


Qn{f) ■= min 

0<j<n 


doo{f,H°°) 


(23) 


\\Vj\\H°°/KerA f 

where Q n {f) is to be interpreted as 0 if 11 \\h°°/K erAj = 0 for some jo £ {1,■ ■ ■ ,n} (in which case 
doo{f , Hjff) = 0 as well by what precedes). 

Corollary 1 Theorem j remains valid if M n {f) gets replaced by Q n {f)- 


Proof: we can assume that Vj £ H°° \ Ker Af for 0 < j < n, otherwise Q n {f ) = 0 and there is nothing 
to prove. By the discussion before the corollary, this amounts to say that / ^ Hff. Next, pick e > 0 and 
gj £ Ker Af such that ||u/ — <?/||oo < /Ker A f + £ f° r each j £ {1, • • • , n}. If we let Wj = Vj — gj , then 

Af{wj) = F f{wj) = r f{vj) and the Wj are linearly independent over C. Indeed, if )C/=o ^j w i = 0 with 
A j 0 ^ 0, applying / yields X)/=o ^i s /(r f) v o = ^ an< i since the Vj are linearly independent we have that 
Sj 0 {T f ) = 0; thus, by the AAK theorem, we get that / £ H°f) C contrary to our initial assumption. 
Replacing now vj by Wj in the proof of Theorem 4 and using that Tf{wj) = Tf{vj ), we obtain instead of 
(22) that, whenever w = X)/=o ^ j w j is such that HrrHoo = 1, then 


\\A f {w )||I > 


1 

n + 1 


min 

0<j<n 


doo{f,H°°) 

\\ v j\\H°° /KerAf + £ 


2 
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Thus, letting e go to 0, we get the desired result from Theorem 1 again. 

■ 

Theorem 4 is useful only if we have a fair appraisal of M n (f). The latter is delicate to estimate in general, 
but in the following subsections we point out three cases where this can be done in different guises. They 
are: the case of a general rational function which can be approached numerically; the case of a Blaschke 
product where estimates can be given in terms of the zeros; the case of Cauchy integrals over hyperbolic 
geodesic arcs in which boundedness of M n (f) can be proved via a careful analysis of formulas behind AAK 
theory, dwelling on the work in [6] . 


5.2 Application to rational functions 


When / is rational, the bounds in Corollary 1 can be numerically computed. As explained in Section 3, the 
general case reduces by partial fraction extension to the special case where / £ ft 2,0 , the detail of which is 
carried out below. 

Write / = p/q where p £ Vn- i, q €E Vn is monic with all roots in D, and p, q are coprime as polynomials. 
Let us write 

q(z) = n ” =1 (z - Cfc) 

where each £ B is repeated according to multiplicity. It is clear from definition (17) that KerT f consists 
of those U 2 -functions vanishing at the zeros of q, hence KerT f = ( q/q)H 2 . Its orthogonal complement in H 2 
is (KerT f )-*- = Vn-i/q, an orthonormal basis of which is given according to the Malmquist -Walsh lemma 
by the formulas [28, ch. V, sec 1]: 


e j( z ) 


(1-10 I 2 ) 1/2 *-c» 

i-6* 


1 <j<N, 


(24) 


where the empty product is understood to be 1. The effect of T f on any member of (KerTy) 2 - is easily 
computed upon introducing a £ Vn-i and b £ Vn-i such that the following Bezout relation holds: aq+bq = 
1. Indeed, one has for any u £ Vn-i[z] that 

( pua 

V Q 


R q (pua) 


(25) 


r,(„/?) = pi?S)=P-(^ + ^ 


qq 


q 


where we used that pub/q £ H 2 and, for any polynomial P, R q (P) indicates the remainder of Euclidean 
division of P by q. In particular, we get from (25) that ImT/ = VN-i/q. The Hermitian scalar product on T 
can be computed in several ways for functions in VN-i/q] one which does not use partial fraction expansion 
is as follows. Pick u,v £ Vn-i ■ Observing that z N /q is conjugate to l/q on T and denoting with Q q (P) 
the quotient of Euclidean division of the polynomial P by q (so that P = qQ q (P) + R q (P)), we get since 
aq + bq = 1 that 


,u v, ,z N u , ,z N ub 


q q qq 


v) = (- 


Qq ( 


N \ I Rq(z N ua) x ! ( N \ \ 

z ua) H- - - ,v) = (Qq(z ua),v), 


(26) 


where we used that R q (z N ua)/q £ H 2,0 and z N ub/q £ z N H 2 are both orthogonal to v £ Vn-i by Parseval’s 
theorem. The last term in (26) is now a scalar product between polynomials which can be computed as a 
Euclidean one in the basis {z k ; 0 < k < N — 1}. 

Writing = Uj/q where was defined in (24), we can use (25), (26) to compute the Hermitian 
matrix M = (PJT /(e*), ej) = (P f(ei), T f(ej)), and an orthonormal family of singular vectors Vo, ■ ■ ■ , ujv-i 
associated with the nonzero singular values of T/ is then obtained by diagonalization of M (of course any 
other orthonormal basis of VN-i/q than (e*,) could be used as well). More precisely, the fc-th row of a 
unitary matrix U such that UMU* is diagonal yields coordinates for Vk in the basis ej. The diagonal terms 
are the squared singular values s 2 (T/) for 0 < k < N — 1, which are none but the d 00 (f,H^ > ) by the AAK 
theorem. Moreover, it follows from Nehari’s theorem that 


II H°°/KeiA f —doc(vjq/q,H ) — |||T^ 3 ^ 


(27) 
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and the last term in (27) is the largest singular value of a Hankel operator with rational symbol which can 
be computed in the same manner as indicated above to compute so(r/). 

Thus, we can evaluate Q n defined in (23) for all n, hence also the lower bound on d 2 (f,H n -i,n) given 
by Theorem 4 and Corollary 1. We implemented a prototype algorithm to compute these two bounds. 
Numerical experiments are presented in Section 7. 


5.3 Application to Blaschke products 

In this section, we use Theorem 4 to derive some lower bounds for Problem RA(n) when / is a Blaschke 
product of finite or infinite degree. This last case is instructive to contrast rational approximation in L 2 and 
L°° norms, for on the one hand the value of Problem RA(n) tends to zero as n goes large (since rational 
functions are dense in it 2 ), while on the other hand / cannot be approximated “at all” by rational functions 
in H °°, i.e. zero is a best uniform approximant. This follows from the lemma below which is not easy to 
locate in the literature. 

Lemma 1 Let b be a Blaschke product and n be a positive integer which is strictly less than the degree of b 
(if b has infinite degree the assumption is void). Then 

doo(b,Tl n , n ) = Moo = 1. (28) 

Proof: clearly d oa (lo,'R. n ^ n ) < 1 for zero is a candidate approximant. Moreover, if r £ V n , n D H°° then 
r £ Hff. Therefore, upon conjugating, we get d 100 (b. 1Z n . n ) > doo(/b,Hff) and it is enough to show the latter 
is at least 1, hence in fact equal to 1. 

Assume first that b has finite degree d , and write b = qd/qd where qd £ Vd has zeros in D only. Then 
b = qd/qd , and the kernel of T^ is bH 2 whose orthogonal complement in H 2 is (KerT^)- 1 - = Vd-i/qd as 
pointed out in the previous section. Now, if pd-i £ Vd- i, then T^pd-i/qd) = Pd-i/qd so that T^ is an 
isometry from (KerTj,)* 1 - onto its image. Consequently the first d singular values of T^ are equal to 1 (the 
remaining ones being zero). That d oa (b 1 Hff) = 1 now follows from the AAK theorem and the fact that 
n < d — 1. 

Assume next that b has infinite degree. We can write b = 6„+i&oo where b n +i has degree n + 1 and b^ 
has infinite degree. If g £ Hff then also b^g £ Hff , and since |&oo| = 1 a.e. on T, we get by the first part 
of the proof that 

\\i >-slloo = ll&oo^- fcooslloo = \\bn+i - boogWoo > 1, 5£^, (29) 

hence d ao i/b,H'^) > 1, as desired. 


We turn to the main result of this section: 


Theorem 5 Let b be a Blaschke product, of finite or infinite degree. Let us arrange its zeros into a (finite 
or infinite) sequence Ci>C2>"' i where each Q is repeated according to its multiplicity and the corresponding 
sequence of moduli is nondecreasing: |<(i| < IC 2 I < For each positive integer n strictly less than the 

degree of b (if b has infinite degree the assumption is void), it holds that 


and also that 


(l ~ ICn+l| 2 ) 

y/n+ 1 


1/2 


< d 2 (b 1 K n , n ) 


E— L - 


Ypi I <d2(b,H nin ). 


(30) 


(31) 


Proof: assume first that b has finite degree d , so that b £ C(T), and write b = qd/qd where qd £ Vd has zeros 
in D only. By the equivalence between Problem RA(n) and RAN(u) discussed in Section 3, we know that 


d2(b, lZ n ^ n ) = d 2 ((6 - 6(0)), U n - 1,„). 
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Now, the Hankel operators and r^_^y coincide and we saw in the proof of Lemma 1 that T^ is an 

isometry from (KerL^)* 1 = Vd-i/qd onto IrnTj, = Vd-i/Qd- Hence the ej given by (24) for 1 < j < d form an 
orthonormal family of d > n+ 1 singular vectors associated with the singular value 1. By the AAK theorem 
it follows that doo(b, H°°) = 1 for 0 < j < d — 1, and since He^loo = (1 — |Ci| 2 ) _1 ^ 2 j estimate (30) follows at 
once from Theorem 4 upon choosing Vj = e.j+i for 0 < j < n. 

Next, if we let Wj = o e2lirfcj ' ^ n+1 ' > Vk)/(n + l ) 1 / 2 for 0 < j < n, we get another orthonormal family 

of n + 1 singular vectors associated with the singular value 1 , and clearly ||u>j||oo < (n + l ) -1 / 2 Sl-^o (1 — 
|C fc | 2 )- 1/2 for all j. Estimate (31) now follows from Theorem 4 again upon replacing the previous Vj by Wj. 

If now b is infinite and k > 0 is the multiplicity of the zero at the origin, we can write ( 6 ) for some 
constant c of unit modulus. Let us define b m = cz m if m < k and 


b m {z) 


<** n 

l=k -\-1 


~ Cl 

101 


Z-Cl 

l -Clz’ 


m > k. 


(32) 


The sequence of Blaschke products {b m } converges to b pointwise on B, and since it is bounded it must also 
converge weakly to b in H 2 . Since b m and b have norm 1, the limit of the norms is the norm of the weak 
limit, hence the convergence is actually strong in H 2 [9, Theorem 3.32]. Consequently 

lim ^(^mi^n.u) — ^2 (^i ^-n.n); 

m—too 

and since estimates (30), (31) depend only of the first n . + 1 zeros of b they remain valid in the limit. 


In view of Corollary 1, the conclusion of Theorem 5 can be sharpened upon replacing in the proof H'UjHoo 
and ||ur,||oo by |||rg v ||| and |||r^ |||. Computations become more involved but in any case cannot increase 
the left hand side of (30) and (31) by more than a factor 2. Incidentally, for q n £ Vn having all roots in 
D, it seems to be an open question which L 2 -orthonormal bases of V n -\/q n have minimax L°°-norm. Using 
such bases instead of ej in the proof of Theorem 5 may improve on the result. 

Since (7) is necessary and sufficient for {£;} to be the zero set of a Blaschke product, an immediate 
corollary to Theorem 5 is: 


Corollary 2 Whenever a n 
product b such that 


is a nonincreasing sequence in ( 0 , 1 ] 


a 


1/2 

n+1 


y/n+ 1 


< inf II b 

T-e7?.„,„n h 2 


r \\2, 


such that T, n a n < oo, there is a Blaschke 


n £ N, 


(33) 


and also 



< inf || b 
ren n , n nH 2 


rh, 


neN. 


(34) 


5.4 Application to Cauchy integrals on hyperbolic geodesics 

Recall that geodesic lines for the hyperbolic metric in E) are radii and circular arcs orthogonal to T [14, ch. 
I]. By definition, a hyperbolic geodesic segment is a compact and connected subset thereof. Alternatively, a 
hyperbolic geodesic segment is the image of a real segment [a, b] C (0,1) under an automorphism of the disk 
( i.e. a Mobius transformation of the type z a(z — zo)/(l — Zqz) with |a| = 1 and zq £ D, in other words 
a Blaschke product of degree 1). Below is a nonstandard characterization of hyperbolic geodesic segments 
which is analytic in nature. We will not use the “if” part but is is interesting in itself. 

Lemma 2 A C 1 -smooth, closed Jordan arc 7 C D is a hyperbolic geodesic segment if, and only if there is a 
constant C = C( 7 ) > 0 such that, to each g £ H 2 , there is h € H 2 with h\j = g | 7 and ||/i ||2 < CHgl^- If g is 
continuous on B, so is h. 
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Proof: if 7 is hyperbolic geodesic segment, then it is the image of a real segment under an automorphism 
ip of D and h(z) = (jo p){z) o tp- 1 does the job. Conversely, if 7 is a C 1 -smooth closed Jordan arc in 
D with endpoints zi, Z 2 and if there exists a constant C = C( 7 ) as in the statement of the lemma, then 
the proof of [ 6 , thm. 10.1] applies (upon trading the geodesic arc G for 7 in that proof) to show that 7 
consists exactly of non-isolated points of the cluster set, as n ranges over N, of poles of best approximants 
to ((z — zi)(z — Z 2 ))- 1//2 £ G(T) from Hff. Because this characterization depends only on z\, Z 2 , it follows 
that 7 must be the geodesic arc joining them. 


In this section, we will consider functions of the form 


/w _ ‘ / *«U 

2*7T Jq z- 4 


(35) 


where: 

(HI) G C D is a geodesic segment, 

(H2) h is a complex-valued function on G, summable with respect to arclength, having continuous argument 
except possibly for finitely many jumps of amplitude 7 r. 

The prototype of such a function is one which is analytic over ID except for two branchpoints of order strictly 
greater than -1. Indeed, by Cauchy formula, such a function can be written as the Cauchy integral, on any 
smooth cut connecting the branchpoints, of the jump of the function across the cut. This jump is locally 
analytic and has continuous argument on the cut (up to the branchpoints by Puiseux expansion), except 
at the zeros that the jump may have on this cut where the argument has left and right limits which differ 
by kn if k is the order of the zero. Choosing the hyperbolic geodesic cut, we get representation (35). It 
may seem artificial to favor the hyperbolic geodesic segment linking the branchpoints among all possible 
cuts. However, this one turns out to attract almost all poles of best rational approximants (see [ 8 ] for this 
and generalizations to finitely many branchpoints) and also of best meromorphic approximants (see [ 6 , thm. 
10.1] and Corollary 3 below), which makes it in some sense the natural singular set of the function. 

We need additional facts from AAK theory that shed light on singular vectors of Hankel operators with 
continuous symbol. They apply in particular to Tf when / is of the form (35). 


For / £ C(T) and n > 0, a best approximant g n to / from 
[33, ch. 4, thm 1.3] which is given by 


in L°° uniquely exists [1, thm. 1.3] 


9n — 


P +{fVn 


f ~9n = 


IVCtfn) _ P ~{fv n ) 


(36) 


where v n is any singular vector of Tf associated with s n (Ty); moreover, the error function / — g n has 
constant modulus s„(T/) a.e. on T [1, thm. 1.3] [33, ch. 4, sec. 1, eqn. (1.12)]. In particular, (36) 
entails that the ratios P ±(fv n )/v n are independent of which singular vector v n associated with s n (r/) 
is used; this is remarkable for if s n (T f) has multiplicity /i, then the union of { 0 } and of all associated 
singular vectors is a vector space of complex dimension p. 

When / £ G(T), the inner factor of a singular vector of Tf is a finite Blaschke product. More precisely, 
keeping notations as in the previous item and letting in addition m = m{n) be the smallest non-negative 
integer such that s m (T/) = s n (T/), the singular vector v n may be inner-outer factorized as 


= bb„ 


(37) 


where w n £ H 2 is outer and b m £ B m is a Blaschke product of exact degree m with zeros the poles of 
9n (= <?m), while b is a finite Blaschke product whose zeros are also zeros of P + (fv n ). Moreover, with 
b , b m and w n as in (37), it holds that 

T/C^X 2 ) = s n(T/) z~ 1 b m (l/z)j(l/z)w n (l/z), 


> 1 


(38) 
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where j is a finite Blaschke product such that jb £ -B^-i and p is the multiplicity of a n (Tf) [1, thm. 
1 - 2 ]- 

• Assumptions and notations being as in the previous items, let v n be a singular vector ofTf associated 
with s„(r f) and (37) be its inner-outer factorization. We claim that b m w n is also a singular vector of 
r f associated with s„(T f). Indeed, we know from the previous item that g n = b~ n [ h for some h £ H°°. 
Since ||/— /i/£> m ||oo = \\fb m — /i||oo, the fact that g n is a best approximant to / from H™ entails that h is 
the best approximant to fb m £ C(T) from H °°, hence 11|T/& m 111 = ||/ — /i/& m ||oo = Sn(T/) by the AAK 
theorem. Taking into account that T f bm (u) = Tf(b m u) for u £ H 2 , and also that TJ($) = P+(/$) for 
$ £ H 2 ’ 0 , while using that b m H 2 ’ 0 C H 2,0 and P+ + P_ = Id, we now compute 

T* fbm T fbm (bw n ) =r* fbm T f (v n ) = P+(jb^ T f (v n j) = P + (b m P + (fT f (v n ))) 

=p+ (b m r* f T f(v n )) = sl(T f )P + ( b m v n ) = sl{Tf)bw n . 

This shows that bw n is a maximizing vector of T f bm . Next, we observe that 

ll r /b m (^n)||2 = ||P_ (6T /bm («;„)) || 2 < ||r /6m (iu„)||2 (39) 

because multiplication by b is an isometry and anti-analytic projection is a contraction in L 2 . Since 
11biUn11 2 = ||rc n || 2 : we conclude from (39) that w n is in turn a maximizing vector of T f bm and that 
equality must hold throughout in this equation. In other words bTf bm {w n ) £ H 2 ’ 0 , which implies 
easily that bTfb m {w n ) = T fb m (bw n ). Consequently 

r}T/(& m u; n ) = P+ (fbbTf(b m w n )) = P + (bP + (fT f(bb m w n ))) (40) 

= P+ (bT f Tf{Vm)) = Sn(r/)P+ (bv n ) = S n (T f)b m W n - 


This proves the claim. 

We now assume that / has the form (35). Using (4) to express definition (17) of the Hankel operator, 
then inserting (35) and using successively Fubini’s theorem and the residue formula, we obtain: 


Tf{v n ){z) = 


= (. 


V 2*7T 




dC = 


2in 






> 1 . 


(41) 


In particular T f{v n ) extends analytically from C \ D to C \ G, and Equation (38) becomes 

Sn(Pf) z~ 1 b m (l/z)j{l/z)w n (l/z) = dg, \z\ > 1. (42) 

Multiplying the restriction of (42) to z £ T by b m j and then taking anti-analytic projection again gives us 
after a similar computation: 


s n (T f )z 1 w n (l/z) 


1 f ji&bhiOH&WniZMZ) 
2in J G z-£ 


l~l > !. 


(43) 


where we took into account (37). Equation (43) entails that in turn w n ( cf. (3)) extends analytically from 
C \ D to C \ G, or equivalently that w n extends analytically from B to C \ G , where G is the reflection 
of G across T. 

We can now establish a technical result which is the key for applying Theorem 4 to functions of the form 
(35). Recall that a family of analytic functions in an open set 11 C C is said to be normal if it is uniformly 
bounded on every compact subset of U. Equivalently, a normal family of analytic functions is one which is 
relatively compact for the topology of locally uniform convergence in fl. 
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Proposition 2 Let f assume the form (35) where hypotheses H1-H2 do hold, and {'I’njneN be a sequence of 
singular vectors ofTf such that ||f ra ||2 = 1 for all n. Denote by w n the outer factor of v n . Then, {w n }nGN 
is a normal family in C \ G 


Proof: we already pointed out that w n is analytic in C \ G . According to Equation (37), the inner-outer 
factorization of v n is of the form v n = bb m w n , and we know from a previous claim (c/. Equation (40)) that 
b m w n is another singular vector of Tf associated with £„(]?/) having the same outer factor w n . Hence we 
can replace v n by b m w n (in other words, we may - and we shall - assume that 6=1 and write v n = b m w n ). 
For correctness, one should of course write m(n) throughout, but we drop the dependence of m on n for 
simplicity. 

To prove that w n is bounded independently of n on each compact subset of C \ G , we parallel the 
argument of [ 6 , thm. 10 . 1 ]. 

Let 1 1 — y aft) parametrize G with an automorphism a of D as t. ranges over a real segment [a, 6 ]. Then t H > 
a'(t) has continuous argument. Let /3i be a finite Blaschke product with real coefficients vanishing precisely 
at the jumps of amplitude tt that t <—> arg h{a{t)) may have on [a, 6 ] (if h(a(t)) is continuous we simply put 
f) = 1). Then t >->• arg(/3i(f)/i(a(t))) is continuous by our assumptions on h. Thus, by Mergelyan’s theorem, 
there is a polynomial T which is real valued on [a, 6 ] and such that |T(f) + arga / (f) + arg(/3i(f)/i(a;(f)))| < n/3 
for t £ [a, 6 ]. In invariant form, this means that the function H = P o a -1 £ H°° fl G(T) is real valued on G 
and moreover that 




e iH{0 mmdf | < 2 Re (e iH ^ /3(0MCK) , 


£ e G, 


(44) 


where /3 = /3i o a 1 is in turn a finite Blaschke product which is real-valued on G. Notice that H and /3 
depend only on / and not on n. 

1/2 

In another connection, since w n has no zero in ID, it has a well-defined square root w n £ H°°. Note, since 
1111 2 = ll^nlb = 1 by assumption, that ||uv / 2 ||2 = ||a ; n ||^ 2 < 1 by the Schwarz inequality. Appealing to 
Lemma 2, let H n £ H 2 take conjugate values to b m w\j 2 on G, with 11 11 2 < G|| 6 m z 7/ 2 ||2 = G||«;i l / 2 ||2 < G. 

Note that H n is continuous on ID since b m and w n are. For j as in (38) (j depends on v n but we drop this 
dependence), consider the contour integral 


H n (0 m b m (OM) f (45) 

2l7r Jt Wn' (0 € 

where it should be observed that the integrand is continuous even though w n may have zeros on T (of course 
at such points w n /wn “ is understood to be 0). In view of (42), this integrand extends analytically on ID\G, 
hence we may rewrite (45) as an integral over the circle T r = {z : \z\ = r} where r £ (0,1) is close enough 
to 1 that T r encompasses G. Then, substituting (42) and using again Fubini’s theorem and the Cauchy 
formula (which is permitted since w n (z) does not vanish for \z\ < r), the integral (45) transforms into 


1 

(2*7t) 2 




Vn(Qh(() 

e-c 7 


e* H ^ H n {om 


1/2 

W n ' 


(0 


df 


1 

2*71 


i -^jT^-HCyn^HniOmdC 
Jg w D 2 ^) 

f b m (CWn /2 (OHOe iH{0 H n { 0 /3(C) dC, 


where we took (37) into account. Altogether, by the construction of H n , we deduce that 


^ f e im) H n {0mbrn(m = ^- [ \b 2 m(0 w n(C)|M0 /3(C)e lff(C) dC- (46) 

2*7T J T Wn' (£) C 2*7 T J G ' 


2*7T 

By (44), we get on the one hand that 

4tt Jg 


b 2 m {()w n (()/3(0H() d|C| < / K(0w n (0\mh(0e iH{C) 


d(. 


(47) 
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On the other hand, since /3, b m ,j are Blaschke products while Huy/ 2 ^ < 1 and ||il n ||2 < G, we see from the 
Schwarz inequality that 


Wn (£) S 


2*7r 

Therefore, in view of (47), (46), and (48), we get that 

1 

27T 


< Cs n (Tf) ||e iif Hoc. 


b 2 m(()wn(0mH0 rf|CI < 2Ca n (T f ) He 1 " || 


(48) 


(49) 


Now, if we multiply (43) (where b = 1) by /3 and apply P to this product, the computation based on 
Fubini’s theorem and Cauchy formula that led us to (41) and (43) yields 


s n {T f )P^(/3w n )(z) = — 






\z\ > 1. 


(50) 


Equation (50) entails that P_(/3*i; n ) extends analytically to C\G and, as |j| < 1 in D since it is a Blaschke 
product, it follows from (49) and (50) that 


|P-(/?*n)(z)| < 2C\\e lH \\ 00 (inf |z-C|) 

CtCr 


-1 


z G C\G. 


This proves that \P—(0w n )\ is uniformly bounded with respect to n on every compact subset of C \ G. In 
another connection, observe from (4) that P+(/3w n ) is uniformly bounded with respect to n on compact 
subsets of ID) \ G, because ||/M n ||2 = ||tin ||2 = 1- Adding up, we get that fiw n is uniformly bounded with 
respect to n on compact subsets of ID) \ G. Since |/3|, which is a finite Blaschke product with all its zeros on 
G, is bounded from below on compact subsets of C \ G, we thus conclude that {w„} is normal in C \ G. By 
reflection across T, normality of {«;„} in C \ G follows, as desired. ■ 

The following corollary to Proposition 2 is worth pointing out as it shows in a rather strong sense that 
most of the poles of best L°° meromorphic approximants to / as in (35) asymptotically cluster to G. 

Corollary 3 Let f assume the form (35) where hypotheses H1-H2 do hold. Denote by g n the best approx- 
imant to f from Hff in L°°. To each neighborhood V(G) of G, there are no, No £ N such that, if n > no, 
then g n has at most No poles outside V(G), counting multiplicity. 

Proof: we make notations as in the proof of Proposition 2. We noticed already before the latter that w n is 
analytic in C \ G. In addition, it is clear that b m {l/z) = l/b m {z) (resp. j{l/z) = 1 /j(z)) since b m (resp. 
j) is unimodular on T. Hence 6 m (l/z) (resp. j(l/z )) is meromorphic in C with poles at the zeros of b m 
(resp. of j) and no zero in ID>. Since the right hand side of (42) is analytic in C \ G, we conclude that every 
zero of b m (and of j) which does not lie on G is a zero of w n with same or greater multiplicity. Now, by 
Proposition 2, every subsequence w nk , has a subsequence w nk( converging locally uniformly in C\G to some 
analytic function w which is not the zero function because ||*h ||2 = lim^oo ||uVi fe< ||2 = 1 - In particular w has 
only finitely zeros z 1 • • • , Zn of respective multiplicities /jl\, ■ ■ ■ , /*jv in HJ\ Vg- Thus, by the Rouche theorem, 
Wn k( has exactly pj zeros in the neighborhood of Zj for £ large enough, counting multiplicities, and no other 
zero in in D \ Vg- Consequently every subsequence of {*hn} has boundedly many zeros in D \ Vg, which 
implies the desired conclusion as poles of g m which do not lie on G are zeros of w n by the first part of the 
proof. 


The main result of this section is the following. 

Theorem 6 Let f assume the form (35) where hypotheses H1-H2 do hold. Then, 

r, d oa (f, 1L n -i,n) ^ ^ d’ 00 (f,H n ) ^ j / f n ^ 

c>l- , , .. - S C/2- , ; — S «2U, 


y/n + 1 


y/n + 1 


where C\,C 2 are strictly positive constants depending on f but not on n. 


( 51 ) 
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Proof: if v n is a singular vector of Tf associated with s n (T/), normalized so that \\v n \\2 = 1, and if w n is 
the outer factor of v n , we deduce from Proposition 2 that ||w ra ||oo = Halloo is bounded independently of n. 
Hence the second inequality in (51) follows from Theorem 4. 

To prove the first inequality, we must show that 

doo(/,^- 1 ,n)<C'd 00 (/,ff“) (52) 


for some constant C independent of n. Let g n be a best approximant to / from Hff in L°°, and write 
g n = r n + h n where r n G 1Z n -i,n D H°°’ 0 while h n G H°°. Note that / — r n € H 00 ’ 0 , hence h n = P+(/ — g n ). 
Obviously it holds that d 00 (f,'JZ n -i tn ) < ||/ — r n ||oo, therefore, it is enough to check that \\f — rnjloo < 
Cd 00 (f, Hff) in order to establish (52). Now, by the triangle inequality, we get that 


11/ - r n ||oo < 11/ - gn || + Halloo = doo(/, Hff) + ||P+(/ - 5n )lloc 


and we are left to prove that ||P+(/ — g ra )||oo < Cd 00 (f, H™). Let v n be a singular vector of T /, associated 
with s„(r/), having inner-outer factorization v n = b m w n , where b m G B m vanishes exactly at the poles of 
g n and w n is outer; this is possible by a previous claim (c/. (40)). Here and below, we should write for 
correctness m = m(n ), but we drop the dependence of m on n for simplicity. From (36) and (38), we gather 
that 

P+(/ - gn) = s„{T f ) P + (b 2 m jw n /w n ) , 

where we also dropped the dependence of j on v n , and since s„(T/) = d cxi (f, Hff) it remains to establish 
that ||P + (b^jvjn/wn) Hoc is bounded independently of n. For this, it is enough to show that from any 
subsequence nj. one can extract a subsequence rik f for which the property holds. Appealing to Proposition 2 
as in the proof of Corollary 3, we can extract from {w nk } a subsequence {w nke } converging locally uniformly 

to some w, analytic in C\G , which is not the zero function. Let w have N zeros lying on T, say z i, ■ • ■ , zjv, 
where multiplicities are accounted by repetition and it is understood if TV = 0 that {zj} is the empty set. 

Pick e > 0 small enough that the circle Ti +£ does not meet G 1 and w has no other zeros than zi, ■ ■ ■ ,zn 
in the corona C e = {z : 1 < \z\ < 1 + e}. When £ is large enough, by the Rouche theorem, w nkt has exactly 
N zeros z,Z n,i in C £ , counting multiplicities with repetition, and {zjA converges to {zj} as a set 
when £ —> +oo (recall that w nk is outer hence has no zero in B). We label the Zj t e so that, say Zj t e G T for 
1 < j < S£ and Zj t i T for si + 1 < j < N. Define 

PnAO = Qn-scAO = n |L s ,+1(/ - 

and let us write w nk( (0 = u AO PnAO where u^(0 is analytic in C \ G and zero-free in C e . By the 
maximum principle, Uj t e converges to w(z)/ nA =1 (;: — Zj) locally uniformly in C \ G 1 . Clearly, 


tin (C) _ -prSf f - \ c~N ^3 AO Qn-biAQ 

^n(s) Qn~ 


(53) 


where we observe that 6 jv- s « = (—Zi t i) Qn-si/IQn-si/ lies in Bjq_ si and that Uj^/Uj^ is continuous 
and bounded independently of £ on C E as well as analytic in the interior of C e . Now, put AO — £ an( i let us 
write 


P+ ( b 2 m jtim/w m ) = b N - se P+ (/ 3 N b 2 m ju e /uf ;) + P+ ( b N - se P _ (, 8 N b 2 m jui/ui )) . (54) 


Recalling that b m ( 1/z) = 1 /b m (z) and j(l/z) = 1 /j(z), we deduce from (4) 


p + ( 0 N b 2 m jtii/ui) (z) 


1 f 1 _ ue(Q df 

2m Jf 6 ^ ( 0/(0 - z) 


M < 


(55) 


and by Cauchy’s theorem we can deform the contour of integration to Ti +e without changing the value of 
the integral: 


P+ (P N b 2 m ju e /w) (z) 


1 f 1 _ ue(Q df 

2*tt i Tl+e 6^(0/(0 ~ z) ’ 


N<1- 


(56) 
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The integral in the right hand side of (56) is now bounded in modulus, independently of £ and z £ B, because 
\b m \ > 1 and \j\ > 1 on € \ D, while e < |£ — z\ and ue/ug is uniformly bounded on Ti_|_ e . Thus, the first 
summand in the right hand side of (54) is bounded in L°°, independently of £, because £ Bjsr- ae . 

To see that the second summand is also bounded, we put T = P_ (i3 N b 2 n ju?/u(^J and we notice that 
Halloo is bounded independently of £ because P N b 2 n jui/u£ is unimodular on T and we just saw from (56) 
that ||P+ (P N b 2 m ju t /ut) ||oo is bounded independently of t. Next, we observe that this second summand is 
P+(&at_^’T) and that it lies in 'R-n-s 1 -i,n-s 1 because for (gTwe have: 

Q ^N- s { P p +^s e m) = QnsA mm - 

and both summands on the right lie in H 2 ’ 0 whence QN- se P + (b N _ Sf 'i>) £ Tn-si- i- We now appeal to 
Grigoryan’s theorem, saying that if P_($) £ 1Zd-i,d then ||P_(4>)|| 00 < cdH^Hoo for some absolute constant 
c, see [33, eqn. (6.1)]. As P+(4>) = P_($) for any function <f>, it implies since the check operation preserves 
lZN-s e -i,N-s t and the L°° norm that 

||P + (6 JV - s ^)||oo = ||P-(6 JV - s ^)||oc < C(N - s*)||& JV _ a< *|| 00 = c(N - s^ll^lloo. 

This achieves the proof. 

■ 

The authors conjecture that Theorem 6 carries over to Cauchy integrals of the form (35) where G is 
a so-called symmetric contour for the Green potential in D (c/. [40, thm 1] for details), the prototype 

of which is an analytic function with finitely many branchpoints of order greater than —1 in the disk. 
For such functions, more generally even if branchpoints have arbitrary order, it was proved in [17] that 
lim n -» 00 d 00 (/,^r,-i,n) 1/n = exp{— 2/C} where C is the condenser capacity of the pair (T, G). The same 
n-th root estimate holds for d 2 (f,'R- n -i, n ) [8, cor. 8], and more generally for the distance from / to TZ n -i >n 
in L p when 1 < p < oo [42]. Inequality (51) compares d 2 (f,'R- n -i,n) and doo(/> T^-n-i,n) in a much stronger 
sense, but still one may wonder if the factor 1/y/n + 1 is really needed. In the special case of Markov 
functions, i.e. of Cauchy integrals of positive densities on a segment, the results in [7] show that this factor 
is in fact superfluous. 

6 Linearized errors 

Given / £ H 2,0 , p n -\ £ V n -i, q n £ V n and a (complex) weight function w £ H°°, the linearized error 
associated with p, q and w in problem RAB(n) is 

£{f,Pn-i,qn,w) := (q n f - p n -i)w. (57) 

It is formally obtained from the error /— p n -i/q n by chasing denominator q n and multiplying by the weight. 
In applied sciences, problem RAB(n) and weighted variants thereof are of great importance to model time 
series as well as to identify linear dynamical systems 2 , e.g. in modal analysis of mechanical structures or 
in frequency analysis of microwave devices [22, 24, 27, 35, 30]. The importance of the L 2 norm in this 
context stems from its statistical interpretation as a variance. Because RAB(n) (or equivalently MA(n)) is 
a difficult non convex problem, several approaches to system identification in engineering have been based 
on linearization. Most popular in this connection are two closely related heuristics, namely the Steiglitz- 
McBride method [41, 35, 36] and the vector fitting method [21, 11]. These are iterative procedures, first 
choosing w = l/ir n where n n £ V n is monic with no root on T, then minimizing \\C(f,p n -i,qn,w)\\2 with 
respect to p n -i as well as q n: the latter being normalized so as to be monic 3 (which yields a convex problem). 

2 For continuous time systems, rational approximation is performed on the imaginary axis rather than the circle. This, is 
equivalent to the present setting thanks to the isometry / i —> \/2f((z + 1 )/(z — l))/(z — 1) mapping H 2 onto the Hardy space 
of {Rez > 0} while preserving rationality and the degree. 

3 What we describe here is the Steiglitz-McBride method, although we should mention that the criterion used is often a 
discretized version of ||£(/,p n _i, q n , 1 /‘7r ra ,)11 2 obtained from pointwise values on T. The vector fitting method is essentially a 
rewriting of the Steiglitz-McBride procedure where rational functions are parametrized in pole-residue form. 
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Subsequently, one replaces w by l/g°, where q° is the optimal q n , and repeats the previous steps until some 
fixed point is reached. Such procedures are prompted by the easy observation that if / £ R n -i >n , then 
the value of the problem is zero from the first iteration already. Accordingly, convergence was studied in a 
classical stochastic setting for system identification, where / £ R n -i, n D H 2: ° is perturbed by white noise 
(the noise is then constitutive of the model), but such heuristics do not converge in general when / ^ 

[37, 25, 36], 

Our purpose here is not to discuss these techniques, nor to compare them with dedicated optimization 
algorithms [13, 30], but rather to stress a link between the value of RAB(n) and the minimization of linearized 
errors. 

We consider weights of the form w = 1/iTn where 7 r n £ V n is a polynomial having no root on T. When 
minimizing the linearized error, Theorem 1 suggests a specific normalization for q n : let us define 

P *.„ := Un £ Vn : ||W7T n ||oo = !}■ (58) 


Then, the following result holds. 

Theorem 7 Let f £ H 2,0 and ir n £ V n , with Z(n n ) (~l T = 0. Then 

d 2 (f,V- n -i,n) = d 2 {f,H 2 ) > min ||£(/,f>„-i, g„,l/7r n )|| 2 . (59) 

Qn €.Pn n 
Pn-lSfn-l 


Proof: we may assume without loss of generality that Z(7r n ) C C\l, for otherwise we can replace every 
linear factor (z — a) of 7 r n for which a £ B by the linear factor (1 — az) which has reflected zero across T. This 
leaves | 7 r n | unchanged on T, and consequently does not affect the minimization of ||£(/,p ra _i, q n , 1 / 7 r„)|| 2 - 
Now, arguing as we did to obtain (9), we find that Tr n ~P+(fq n /ir n ) is a polynomial of degree at most 
n — 1. If we write p n -i(q n ,n n ) for this polynomial and take into account that Z(ir n ) D D = 0, we see from 
Parseval’s theorem that for fixed q n £ V n the criterion ||£(/,p n -i, Qn, 1 /7Tn)11 gets minimized precisely when 

Vn- 1 = Pn-l(<7n,7T n ), SO that 


min II£(/? Pn—l, Qn, 1/t£i) II 2 
Pn-l^Vn-1 


l|P - 


P/(^/7T„)||2- 


(60) 


Let 


P-TT n •— {Qn/^n • Qn £ P^nf- 


Since 7 r n has no zeros on D, it holds that K nn C <S°°, the unit sphere of H°°. Identifying V n with C n+1 ~ 
K 2 "+ 2 by taking coefficients as coordinates, we see that K^ n is homeomorphic to the Euclidean sphere S 2n+1 
via the map q n /ir n >->• Q'n./1|< 7 n ||2 which is odd. Therefore K, Tn is a compact subset of 5°° of genus 2 n + 2, 
and by (16): 


d 2 (f,Rn-l,n)=d 2 (f,H 2 n )> 


mm 

n,/7V n eK n 


\\A f (q n /n n ) || 2 


(61) 


which is (59) in view of (60). 


It follows easily from a compactness argument that the minimum in the right hand side of (59) is attained. 
However, it not a priori obvious how to compute it for P^ n is not convex. Numerically, this issue can be 
approached as follows. First, we assume without loss of generality that 7 r n has no roots in D, so that (60) 
holds (cf. proof of Theorem 7). Next, for £ £ T, let 

Pnn,£ := {Qn £ Vn ■ H^n/^nlloo = 1; 9n(£) = ^n(£)}- 

Observe that P Wn ^ is never empty when Tr n has no zero on T. Indeed, for small e > 0, it holds that 
| 7 r n (e * e )| 2 — \e{e l ° — £)| 2 > 0 hence, by Fejer-Riesz factorization (see Lemma 4 to come), there is a polynomial 
q n with |g„| < | 7 r„| on T and \q n {0\ = Kn(£)l- Thus, 5 „ 7 r„(£)/g n (£) lies in P^. Clearly 

= u 5,CeT C P-K n ,i, (62) 
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and multiplying q n by £ £ T cannot change the value of \\Af{q n /i: n )\\ 2 . Therefore it holds that 

min ||£(/,p„_i,g„,l/7r n )|| 2 = min^(£) (63) 

Qn^P-Kn ££T 

Pn — l^'Pn—1 

where the function ?/>(£) is given by (c/. (60)) 

V>(£) = min \\£(fiPn-i,qn,l/ir n )h = min \\ A f(Qn/n n ) H 2 - (64) 

Pn-l£P»-l 

Note that i/>(£) can be computed as the solution of a convex problem for each £, because Pn n ,£ is a convex 
set and ||A/(g„/7r„)|| 2 a quadratic criterion. Granted this ability to evaluate ip pointwise, we discuss below 
how to numerically estimate the minimum in (63). 

Clearly ip is the zero function when / £ lZ n -i,n, for if / = p/q with deg <7 < n we may pick q n = q as 
minimizer in (64). The next lemma describes this minimizer in greater detail when / ^ lZ n - 2 ,n—i- 

Lemma 3 Let f £ H 2,0 and n n £ V n , with Z(TT n ) (~l B = 0. If f ^ lZ n - 2 , n -h then the minimizing q n in 
(64) is unique, has all its roots in B, and exact degree n. 

Proof: assume that q n , 1 and q n , 2 are distinct minimizers, that is, q n ,i,qn ,2 £ Pn n ,£ and ||(< 7^,1 /TTn )||2 = 
\\Af{q n , 2 /^n )\\2 = ip{0- Put qn ,3 = (<7n, i + q n , 2 )/% £ P nn ,t- By strict convexity of the L 2 norm, we get 
Af(q n ,i/TT n ) = Af{q n ^/'K n ) otherwise we would have that || Af(q n ^/Tt n )\\2 < ip(Q which is absurd. Set 
q = qn, 1 — q n ,2 £ Vn- Then Af(q/n n ) = 0 implying by definition of Af that fq/TT n £ H 2 . A fortiori then 
fq £ H 2 , and since / £ H 2: ° we must have that fq is a polynomial of degree at most n — 1, say p. Thus, 
/ = p/q, and as q has a root on T (namely £) the latter must be cancelled by a corresponding root of p. 
Altogether / £ lZ n - 2 ,n-i 1 thereby showing the uniqueness part of the lemma. Let now q n ^ be the unique 
minimizer and b a Blaschke product with poles in Z{q n ^) nC\D. Then bq n ^ £ V n has same modulus as 
q n on T, hence there is ( £ T such that (bq n ^ £ P Wn ,£- Since Qb is a Blaschke product, reasoning as in (39) 
yields \\Af((bq n ^/ir n )\\2 < ||A/(g n ^/7r n )|| 2 so that £ bq n ^ is in turn a minimizer, hence is equal to q n ,£ by 
the uniqueness part just proved. Now, q n ^ 0 since q n ,t(£) = 7r n (£) ^ 0, therefore Q) = 1. Thus, b must be 
a constant, that is to say there cannot be a zero of q n ^ outside B. Finally, assume that degq n ^ < n. Then 
q n ^(z)zt; lies in P-K„,e and, since is a Blaschke product, it follows as before that q n ,£zt; is a minimizer, 
hence it must be equal to q n by uniqueness. This contradiction achieves the proof. 

■ 

We need a continuity property of the Fejer-Riesz factorization that we could not ferret out in the literature. 
Write T n for the space of trigonometric polynomials of degree at most n, i.e. sums of the form ]P|fc|<n ake lke . 
For fixed n, V n and T n have a natural topology induced by any norm. 

Lemma 4 To each nonzero T £ T n such that T > 0 on T, one can associate continuously a unique polyno¬ 
mial q £ V n having no zero in B and such that \q{e lS )\ 2 = T{e l8 ) with g(0) > 0. 

Proof: let Tpf C T n be the closed subset of trigonometric polynomials which are non-negative on T. For 
T £ Tpf , existence of q £ T n such that \q\ 2 = T on T is a classical result known after Fejer and Riesz [38, 
sec. 53]. Since \z — a\ = |1 — za\ for z £ T, clearly q may be chosen zero free in B if T ^ 0. Then, q\ B is outer 
in H°°, for it has no zero and it extends analytically across T [14, ch. II, thms. 6.2 & 6.3]. Thus, formula 
(5) shows that q is uniquely defined by log|g| = logT/2, therefore also by T. Moreover, each coefficient of 
q is a continuous function of logT £ L 1 , because the fc-th coefficient is just the derivative q( k \0)/kl and we 
may differentiate (5) under the integral sign. To achieve the proof, we establish that T H > logT is continuous 
from T + \ {0} into L 1 . 

First, we claim that W i—|| logTj|i is continuous from Tn \ {0} into R. To see this, it is enough to show 
that if T {*4 tends to T in T r f \ {0} as k —> oo, then || logT^ fcf ^[|i tends to || logT||i for some subsequence 
p{ k A By the first part of the proof, we can write T {*4 = |gl fe l| 2 with 

q {k} {z) =q {k} (0)U]f =1 (l- zaj k} ), a {k} £ B, 
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where multiplicities are counted by repetition and the ordering of the roots for each k is arbitrary. Note that 
|gi fc i(0)| is bounded, since by the Schwarz inequality: 


k {fc} (o)| 


b 


q^(e ie )dO 


<\W 


{fell 



Therefore, there is a subsequence q^ k A such that q^ k A( 0) converges to c £ C and a ] kf converges to dj £ D 
for each j 6 {1, ■ ■ ■ , n}. If we let 

q{z) = dl!? =1 (1 -zaj), 

then clearly |gi fe d| 2 converges to |g| 2 almost everywhere on T, so that necessarily \q\ 2 = T. In particular, we 
have that c ^ 0 otherwise T would be identically zero, a contradiction. Now, since log turns products into 
sums, we are left to show that if —> b in D, then 


r 2n 


lim 

k —»+oo 


log 11 — e ie b^\ 


o2-k 


de = 


log |1 — e™b\ de. 


(65) 


When |6| < 1 relation (65) is obvious. If |6| = 1, we may assume by rotational symmetry that 6=1 and 
£ [0,1], in which case (65) follows by dominated convergence from the observation that |1 — b^e lS \ > 

| sin0| for |0| < n/2. This proves the claim. 

Now, if T^ k } tends to T in Tn \ {0}, it is plain that logld fe i converges to logT almost everywhere 
on T and by the previous claim the id-norm of the limit is the limit of the id-norms. Thus, the desired 
id-convergence of logld fc i to logT follows from Egoroff’s theorem [39, ch.3, ex.17]. 


With the help of Lemmas 3 and 4, we now prove that ip is continuous: 

Lemma 5 Let f £ H 2,0 and w n £ V n , with Z( 7r n ) (~l D = 0. Then the map ip defined by (64) is continuous 
on T. 

Proof: if / £ we mentioned that ip = 0 already. Otherwise, dwelling on Lemma 3, let q n ^ indicate 

the unique minimizer in the last term of (64). By definition of P nrni £ we have that < |7r n | on T and 

that q n ,^{C) = 7T„ (f ), in particular q n ^ is bounded independently of £. Thus, from any convergent sequence 
fk —■► C ° n T, we can extract a subsequence ^ for which q n ,£ ke converges uniformly to some q £ V n , and 
passing to the limit we see that q £ P nrl ^ . Given e > 0, we can pick the sequence fk. so that 

lim ip(£k) = l < lim inf ip(() + e, 

k —>+oo 

and by continuity of q n H > \\Af{q n /Tt n )^2 from V n into M we get that 

lim inf ip(C) +£ > l = lim || A f (q n>i * /7r n )|| 2 = || A f (q/ir n )\\ 2 > ip{£). (66) 

<;—>■£ £—>oo *■ 

Since e > 0 was arbitrary, we conclude that ip is lower semi-continuous. To see that ip is in fact continuous, 
it is enough to establish the following claim: to each £ G T and e > 0, there is rj > 0 such that |£ — £| <77 
implies existence of q^ G P-K n ,c. with \\q n ,$, — (?c||oo < £• Indeed, if the claim holds, we get from (64) that when 

I?-Cl < v- 

V'(C) < \\Af(qc/^n)h < \\Af{q nti /nn )\\2 + \\Af{(qc - q n ,t)/i r„)|| 2 < ip(Q + e||//7r„|| 2 , 

and since e was arbitrary we conclude that lim sup^^ ip{C) — V’(C) whence ip is indeed continuous in view 
of (66). 

To establish the claim, observe from Lemma 4 since |7r n | 2 — |(?n,j| 2 is a non-negative trigonometric poly¬ 
nomial of degree at most n on T that 

kn , £ ( e ^)| 2 + |«„, l ( e ")| 2 = | 7 r „( e «)| 2 ( 67 ) 
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where n n ^ £ V n has no root in D and is uniquely defined by (67) together with the normalization n n ^(0) > 0. 
As 7r„(£) = q n x(f), we can write n n ^(z) = (z — £ )Q n -i(z) : and for £ E T we set more generally: 

K n ,c( z ) = ~—= sup \(z — £)Q n -i(z)/ir n (z)\. (68) 

zeT 

Clearly \n n x\ 2 < |7r n | 2 on T so that, by Lemma 4, there is a unique P n £ £ V n having no root in B and 
meeting P n ^ (0) > 0 such that 

\P n ,d ei0 )\ 2 + Kx(e i9 )\ 2 = K(e ie )\ 2 . (69) 

Since q n ^ has all its roots in B and exact degree n by Lemma 3, it follows from (67) and the uniqueness 
part of Lemma 4 that P n £ = cq n £ where c £ T is such that cq n g has positive leading coefficient. Moreover, 
it is easily checked from (68) that \K n (\ 2 is arbitrary close to |ft n) £| 2 in 7^ if |£ — Cl is sufficiently small. 
Therefore, from (69) and the continuity property asserted by Lemma 4, we deduce that P n x is arbitrary 
close to cq„£ in V n if |£ — £1 is sufficiently small. Consequently Q ^ = cP n f is arbitrary close to q n ^ when 
|£ — £| is small enough. Now, by (69) and the definition of , it holds that |Qcl = \PnX | < |7r n | on T, and 
also that |Qc(OI = |-Pn,c(C)| = Kn(£)| because k„^(£) = 0 by construction. Hence, q$ = (ir n {()/Q({())Qc 
lies in P^x, and since Qf(£) —t q n x(0 = n n(0 as £ —> £ we have that (7r„(£)/Qf (£)) —> 1 when £—>■£. 
Thus, just like Qq, the polynomial q £ is arbitrary close to q n ^ when |£ — £| is small enough, which proves 
the claim. 

■ 

To estimate the right hand side (63), it remains to minimize ip(f) over £ £ T, which can be numerically 
performed by dichotomy because T is compact and 1-dimensional while if is continuous by Lemma 5. 

A natural question is whether the lower bound (59) can be sharp. The answer is no except in the trivial 
case where / £ 77„_i jra : 

Proposition 3 Assumptions and notations as in Theorem 7, it holds if f ^ P n ~i,n that 

d 2 (f,TZ n - i,n) > min ||£(/,p n _i, q n , 1/7t„)[| 2 (70) 

9n G J°7r n 

Pn-lB'Pn-l 

Proof: as in the proof of Theorem 7, we may assume that Z(ir n ) C C \ B and then, by (60), the right hand 
side of (70) is equal to min 9n ep„ n II Af(q n /Tt n )\\ 2 . Let q n p be a minimizer of the latter, and b n ,i = q n ,i/Qn,i 
a minimizing Blaschke product in (12). Multiplying q n p and q n ,\ by a real constant, we may assume 
<7n,i £ P-K n ■ We can also multiply q n p by a unimodular constant so that, using (62), there is £o £ T for which 
<7n,o £ Pn„Xo- Now, if (70) is an equality, we get by definition of q n ,o, qn ,l that 

d 2 {f,Hn-l,n) =\\P-(fq n ,o/^n)\\2 < \\P - (f q n ,l / TT n )h 

HI P -(/(3n,l/?n,l)(<fn,l/^n))||2 < ||P- {fq n ,l/q n ,l ) || 2 (71) 

=d 2 (f ,!Z n -l,n) 

where we used in the second inequality that q n ,\/^n £ S°°. Consequently equality holds throughout (71), 
implying in particular that \q n ,i\ = kn| on T. Thus, as both polynomial have no root in B their ratio 
is a unimodular constant, and renormalizing q Hy \ if necessary we may assume that q n ,\ = tt™- Then c = 
9n,i(£o)/'7ra,i(£o) is a unimodular constant such that cg„q £ P nn Xoi and by the uniqueness part in Lemma 3 
we see that the first inequality in (71) can be an equality only if q n fi = cq n p. Altogether q n ,o/Tr n = cb Hj i is 
in turn an optimal Blaschke product in (12). This optimality entails that [6, thm. 8.2] 

AfAfixinfi/Ttn) = P+ ^|A/(^n,o/^n)| qn,o/^nJ- 
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Consequently, letting (u, v) l 2 = Re(u,w), it holds for all q n £ Pn n ,£ 0 that 

(Af(‘ ln,o/ir n ), A f (q nfi /ir n ) - A f (q n /TT n )) L 2 = (A* f A f (q nfi /ir n ), q n p/^n - Qn/^L 2 

= (P+ ^\Af (^njO/^n) | Qn^o/^n^ 5 qn^ol^n 9n/^n) L 2 (72) 

={\Af{q nfi /-K n )\ 2 q n $/-K n , q n ,o/^n - qn/^L 2 
( \Af {qn,o/^n) | 5 1 <7n/<7n,o).L 2 , 

where we used in the third line that (g n ,o — qn)/^n £ H 2 to get rid of P+ and in the last line that 
q n ,o/^n = Kn/qn ,0 On T. 

On the one hand, we see that (72) is non-negative for all q n £ Pn n ,£ 0 because \q n /qn,o\ = \qn/^n\ < 1 on 
T. In fact, it can be made strictly positive: indeed, Af(q H} o/^n) is not the zero function for / ^ lZ n -i, n , and 
as discussed before (62) one can pick q n £ P-n n fy such that \q n /qn,o\{Q < 1 for £ ^ Co on T. On the other 
hand, the fact that q n 0 is a minimizer in the convex problem (64) (where £ is set to £o) implies that (72) is 
nonpositive for all q n £ P-n-n^o [lb, prop. 5.23], a contradiction which concludes the proof. 


7 Numerical results 

In order to study how effective the bounds given by Theorem 4, Corollary 1 and Theorem 7, we wrote a 
prototype implementation in each case and ran it on a few examples. We report in this section the results 
obtained on the following set of functions / £ H 2,0 . For each of them, we consider the problem of best H 2 
approximation by a rational function in lZ n -\ tn with n = 4. 

• Example 1: / : z k>g((102 — 9)/(102 + 9)). 

• Example 2: / is a rational function of degree 5. 

• Example 3: / is a rational function with 20 poles that have been randomly and uniformly chosen 
inside the unit disc. 

• Example 4: / is a rational function with 20 poles that have been randomly and uniformly chosen 
inside the disc of radius 0.2 centered at the origin. 

• Example 5: / is a rational function with 20 poles that have been randomly and uniformly chosen 
inside the annulus of radii 18/20 and 19/20 centered at the origin. 

• Example 6: / is fairly close to a rational function of degree 4. Namely, we chose a rational function 
g of degree 4 and then we obtained / by perturbing each Fourier coefficients of g with a small noise of 
relative error bounded by 0.01. 

• Example 7: f : z i—> exp (—\/{z — 0.9i)) — 1. 

These examples have been chosen so as to exhibit different kind of singularities inside the disc, which may 
or may not be close to the unit circle, in order to cover various situations. 

Before discussing the results, let us say a few words on the implementation. We are using Matlab R2011b. 
For each example, / is actually approximated by a truncated Fourier series /. The order of truncation is 
chosen so as to ensure that / and / agree to at least 40 bits on the unit circle. The bounds of Theorem 4 
and Corollary 1 are computed as described in Section 5.2: indeed, since / is a truncated Fourier series, it can 
be written / = p/q where p £ Vn-i and q = z N . It turns out that, when q is a power of z, the construction 
in Section 5.2 gets simpler since q = 1, whence Bezout relation is just aq + bq = 1 with a = 1 and 6 = 0. We 
handled all examples using this technique, even though in examples 2 to 5 the number N becomes quite large 
(up to 1500) and it would have been more efficient (but would also have required more implementation) to 
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forget about / and to apply the construction in Section 5.2 to the original /. Anyway, even for fairly large 
N, we obtained our results in a few seconds on an Intel Xeon at 2.67GHz, with 4GB of memory. When 
computing the bound of Theorem 4, the norms H^ Hoo are estimated by sampling Vj at 8000 points evenly 
distributed on the unit circle. 

Regarding the bound of Theorem 7, its computation reduces to finding the minimum of +£), for £ £ T (see 
(63) and (64)) as explained in Section 6. For a given £, we evaluate ^(£) by solving a convex optimization 
problem. For this purpose we use CVX, a package for specifying and solving convex programs within 
Matlab [19, 18]. More precisely, we precompute Af(z^~ l /n n ) (j = l...n + 1) in the Fourier basis, i.e. 
we compute a N x {n + 1) matrix M = ( rmj ) such that Af(z^~ 1 /n n ) = Therefore, if 

q n = i a j A* -1 , we have that 




1 cn \ 

WMvnMWi = 

M 

V a n +i j 


Since N is much larger than n, it is convenient to compute a decomposition M = QR , where Q is orthogonal 
and R is upper-triangular. Since ||Mi >||2 = ||i?n ||2 for all vectors v and only the first n + 1 rows of R are 
non-zero, we end up handling a (n + 1) x (n + 1) matrix instead of M. Now, 


V>(£) = min 

Qn £ 7r n , £ 

The set Pn n ,z is convex, but described by infinitely many constraints. Therefore, we consider a set Ti of 50 
points regularly spaced on the unit circle and we define 

n+1 

={ln = Yl a i 2:3-1 '■ qn (0 = 7r ™(£)’ and V C eT l. kn(OI < MO}- 

J=1 

We first use CVX to solve our minimization problem subject to q n £ ^. This gives an optimal polynomial 

qn^ . Next, we construct a set T 2 by adding to Ti the points of T where \qn^ /^n\ reaches a local maximum. 
We then use CVX to solve our minimization problem subject to q n £ P^J ^, where 

n+1 

= {<ln = Yl a J '■ 9n(0 = 7T„(£), V£ G T 2 , |(?n(C)l < ^(C)}- 
i=i 

( 2 ) 

This gives a new optimal polynomial ; , and we repeat the process until we reach a step k where 
max^gT l9« d (C)/ 7r ri(C)l ~ 1 falls below the level of numerical errors produced by CVX. 

It is worth pointing out that, although sufficient in most cases to get an idea of the numerical value 
of +£), this procedure yields no certified estimate of the bound in Theorem 7. Actually, CVX is a user- 
friendly generic software, able to tackle many types of convex optimization problems with a powerful syntax. 
However, it offers little control on the difference between the true mathematical solution and the numerical 
estimate thereof. Moreover, when too many constraints enter the game, it quickly yields no solution at 
all. In addition, it is probably much slower than would be a dedicated tool to solve that particular convex 
problem. The point we want here to make is that accurately estimatig the bound in Theorem 7 {i.e. aiming 
at more than a prototypical illustration of the content of the paper) requires further work. 

The numerical results proper are reported in Table 1 where the second column is (lower bound 

given by Theorem 4), the third column is (lower bound given by Corollary 1), the fourth and fifth 
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columns are min 9 „eP„„,p„_ 1 eT , „_ 1 \\£(f,Pn-i, <7n,l/ 7r n)ll2 (lower bound given by Theorem 7) for two different 
choices of 7r ra . 

For an appraisal of the sharpness of our results, we also ran RARL2 4 , a software tool that tries to compute 
a solution to problem RAB(n). RARL2 looks for local minima of the criterion in a fairly systematic way, 
and returns the best approximant it could find. As a consequence, it gives an upper bound for the value 
of problem RAB(n) which is likely to be tight and therefore interesting to compare with our lower bounds. 
The error ||/ — r ||2 generated by the candidate best approximant r computed by RARL2 is reported in the 
last column. 

The bound given by Theorem 7 has the advantage of allowing the user to choose a weight 7r„ which 
offers extra-flexibility to try to improve the estimate. Yet, it is not obvious how to pick 7r n in general. The 
simplest choice is 7r n = 1 (reported in the fourth column of the table). Another, appealing possibility is to 
put 7r n = q* where q* is the denominator of the rational function computed by RARL2 (since q* has all 
its poles inside the disc, ir n has all its poles outside, as required). The corresponding results are reported in 
the fifth column of the table. 


Example 

Bound of Th. 4 

Bound of Corollary 1 

Bound of Th. 7 

with 7Vn — 1 

Bound of Th. 7 

with 7T n = qf 

RARL2 

1 

2.884744e-3 

2.887532e-3 

4.04e-3 

10.8e-3 

11.5e-3 

2 

7.731880e-2 

7.732037e-2 

12.4e-2 

24.3e-2 

24.72e-2 

3 

2.459346 

2.470149 

2.286 

0.258 

16.6907 

4 

1.234503 

1.234861 

1.94 

1.8 

6.5721 

5 

47.26312 

47.30424 

2.14 

N/A 

178.3152 

6 

2.894007e-3 

2.894380e-3 

9.62e-3 

12.46e-3 

12.5e-3 

7 

1.780707e-4 

1.782276e-4 

0.7977e-4 

6.3409e-4 

6.3742e-4 


Table 1: Numerical results 

As can be seen from the table, the refinement of Corollary 1 with respect to Theorem 4 is almost negligible 
on all examples. The bound given by Theorem 7 with 7r n = 1 is better than Theorem 4 and Corollary 1 in 
4 cases out of 7, but not overly so. Considering that the computation time is generally much longer for this 
bound, this improvement can be considered as rather expensive. In contrast, when 7r„ = q*, the bound of 
Theorem 7 becomes fairly sharp in cases 1,2,6, and 7, which is encouraging. This better bound comes at the 
cost of a longer computation time though, mostly because the convex optimization problems involved with 
this choice of 7r n seem more difficult to solve. In Example 5, for instance, we were not able to obtain reliable 
results from CVX. However, it also appears that choosing 7r n = q* is not always best, and it would be quite 
interesting to further understand which 7 r„ are efficient in this respect. 
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